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Ronald Fagin, Ravi Kumar, Kevin S. McCurley, Jasmine Novak, D. Sivakumar, John A. Tomlin, 
David P. Williamson 

May 2003 Proceedings of the twelfth international conference on World Wide Web 

Additional Information: full citation , abstract , references , citings , index 
terms 



Full text available: ff) pdf(231 .55 KB) 



The social impact from the World Wide Web cannot be underestimated, but technologies 
used to build the Web are also revolutionizing the sharing of business and government 
information within intranets. In many ways the lessons learned from the Internet carry over 
directly to intranets, but others do not apply. In particular, the social forces that guide the 
development of intranets are quite different, and the determination of a "good answer" for 
intranet search is quite different than on the Int ... 

Fast detection of communication patterns in distributed executions 
Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced Studies 
on Collaborative research 

Full text available: g pdf(4.21 MB) Additional Information: full citation , abstract , references , index terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based on 
process-time diagrams are often used to obtain a better understanding of the execution of 
the application. The visualization tool we use is Poet, an event tracer developed at the 
University of Waterloo. However, these diagrams are often very complex and do not provide 
the user with the desired overview of the application. In our experience, such tools display 
repeated occurrences of non-trivial commun ... 

Information retrieval session 7: web: Automated index management for distributed web 
search 

Rinat Khoussainov, Nicholas Kushmerick 

November 2003 Proceedings of the twelfth international conference on Information and 
knowledge management 

Full text available: ^ pdf(207.Q9 KB) Additional Information: full citation , abstract , references , index terms 

Distributed heterogeneous search systems are an emerging phenomenon in Web search, in 
which independent topic-specific search engines provide search services, and metasearchers 
distribute user's queries to only the most suitable search engines. Previous research has 
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investigated methods for engine selection and merging of search results (i.e. performance 
improvements from the user's perspective). We focus instead on performance from the 
service provider's point of view (e.g, income from queries ... 

Keywords: distributed web search, reinforcement learning, stochastic game 

4 Tools and approaches for developing data-intensive Web applications: a survey Q 
Piero Fraternali 

September 1999 ACM Computing Surveys (CSUR), volume 31 issue 3 

•— ii . , ... Ul _ ,r /co>( on Additional Information: full citation, abstract, references, citings, index 

Full text available: 1 ^ pdf(524.80 KB) — 

* 1 terms 

The exponential growth and capillar diffusion of the Web are nurturing a novel generation of 
applications, characterized by a direct business-to-customer relationship. The development 
of such applications is a hybrid between traditional IS development and Hypermedia 
authoring, and challenges the existing tools and approaches for software production. This 
paper investigates the current situation of Web development tools, both in the commercial 
and research fields, by identifying and characte ... 

Keywords: HTML, Intranet, WWW, application, development 



5 Computing curricula 2001 
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A survey of Web metrics | 

Devanshu Dhyani, Wee Keong Ng, Sourav S. Bhowmick 

December 2002 ACM Computing Surveys (CSUR), volume 34 issue 4 

Full text available:^ pdf(289.28 KB) Additional Information: full citation , abstract , references , index terms 

The unabated growth and increasing significance of the World Wide Web has resulted in a 
flurry of research activity to improve its capacity for serving information more effectively. 
But at the heart of these efforts lie implicit assumptions about "quality" and "usefulness" of 
Web resources and services. This observation points towards measurements and models 
that quantify various attributes of web sites. The science of measuring all aspects of 
information, especially its storage and retrieval or ... 

Keywords: Information theoretic, PageRank, Web graph, Web metrics, Web page similarity, 
quality metrics 



7 Information retrieval on the web 
Mei Kobayashi, Koichi Takeda 

June 2000 ACM Computing Surveys (CSUR), volume 32 issue 2 

c ii * * •. ui ^/nooni^D\ Additional Information: full citation, abstract, references, citings, index 

Full text available: pdf(21 3.89 KB) — 

l^i terms 

In this paper we review studies of the growth of the Internet and technologies that are 
useful for information search and retrieval on the Web. We present data on the Internet 
from several different sources, e.g., current as well as projected number of users, hosts, 
and Web sites. Although numerical figures vary, overall trends cited by the sources are 
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consistent and point to exponential growth in the past and in the coming decade. Hence it is 
r) not surprising that about 85% of Internet user ... 

Keywords: Internet, World Wide Web, clustering, indexing, information retrieval, 
knowledge management, search engine 



8 Object-based navigation: an intuitive navigation style for content-oriented integration jj| 
environment 

Kyoii Hirata, Sougata Mukherjea, Yusaku Okamura, Wen-Syan Li, Yoshinori Hara 
April 1997 Proceedings of the eighth ACM conference on Hypertext 

Full text available: Q pdf(1 .29 MB) Additional' Information: full citation , references , citings , index terms 



Keywords: COIR, World-Wide Web, content-oriented integration, object-based navigation, 
object-level integration, relationship among objects 



9 Analysis of navigation behaviour in web sites integrating multiple information systems Q 
Bettina Berendt, Myra Spiliopoulou 

March 2000 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 9 Issue 1 

Full text available: «gj pelf (28 1.1 4 KB) Additional Information: full citation , abstract , index terms 

The analysis of web usage has mostly focused on sites composed of conventional static 
pages. However, huge amounts of information available in the web come from databases or 
other data collections and are presented to the users in the form of dynamically generated 
pages. The query interfaces of such sites allow the specification of many search criteria. 
Their generated results support navigation to pages of results combining cross-linked data 
from many sources. For the analysis of visitor naviga ... 

Keywords: Conceptual hierarchies, Data mining, Query capabilities, Web databases, Web 
query interfaces, Web usage mining 



10 An architecture for secure wide-area service discovery 

Todd D. Hodes, Steven E. Czerwinski, Ben Y. Zhao, Anthony D. Joseph, Randy H. Katz 
March 2002 wireless Networks, volume 8 issue 2/3 

Full text available: Q pdf(365.68 KB) Additional Information: full citation , abstract , references , index terms 

The widespread deployment of inexpensive communications technology, computational 
resources in the networking infrastructure, and network-enabled end devices poses an 
interesting problem for end users: how to locate a particular network service or device out 
of hundreds of thousands of accessible services and devices. This paper presents the 
architecture and implementation of a secure wide-area Service Discovery Service (SDS). 
Service providers use the SDS to advertise descriptions of available ... 

Keywords: location services, name lookup, network protocols, service discovery 




11 Image Retrieval from the World Wide Web: Issues, Techniques, and Systems 
M. L. Kherfi, D. Ziou, A. Bernardi 

March 2004 ACM Computing Surveys (CSUR), volume 36 issue 1 

Full text available: t p a | pdf(294.13 KB) Additional Information: full citation , abstract , references , index terms 
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With the explosive growth of the World Wide Web, the public is gaining access to massive 
amounts of information. However, locating needed and relevant information remains a 
difficult task, whether the information is textual or visual. Text search engines have existed 
for some years now and have achieved a certain degree of success. However, despite the 
large number of images available on the Web, image search engines are still rare. In this 
article, we show that in order to allow people to profi ... 

Keywords: Image-retrieval, World Wide Web, crawling, feature extraction and selection, 
indexing, relevance feedback, search, similarity 

1 2 Web searching: Specialisation dynamics in federated web search 
Rinat Khoussainov, Nicholas Kushmerick 

November 2004 Proceedings of the 6th annual ACM international workshop on Web 
information and data management 

Full text available: »g pdf(1 38.32 KB) Additional Information: full citation , abstract , references , index terms 

Organising large-scale Web information retrieval systems into hierarchies of topic-specific 
search resources can improve both the quality of results and the efficient use of computing 
resources. A promising way to build such systems involves federations of topic-specific 
search engines in decentralised search environments. Most of the previous research 
concentrated on various technical aspects of such environments (e.g. routing of search 
queries or merging of results from multiple sources). W ... 

Keywords: competition, federated web search, topic specialisation 

13 VideoQ: an automated content based video search system using visual cues 
Shih-Fu Chang, William Chen, Horace J. Meng, Hari Sundaram, Di Zhong 

November 1997 Proceedings of the fifth ACM international conference on Multimedia 

Full text available: g pdf(1.67 MB) Additional Information: full citation , references , citings , index terms 



14 Managing routing tables for URL routers in content distribution networks 
Zornitza Genova Prodanoff, Kenneth J. Christensen 

May 2004 international Journal of Network Management volume 14 issue 3 

Full text available: «gj pdf(337.00 KB) Additional Information: full citation , abstract , references , index terms 

Large-scale content distribution networks (CDNs) can be built using URL routers to redirect 
client HTTP requests to the nearest content source. URL routers employ very large routing 
tables, to improve the manageability of CDNs, we propose to use URL signatures to reduce 
the size of routing tables and aggressive hashing to speed-up routing look-ups. 

1 5 Geospatial mapping and navigation of the web 
Kevin S. McCurley 

April 2001 Proceedings of the tenth international conference on World Wide Web 

Full text available: *g| pdf(1.06 MB) Additional Information: full citation , references , citings , index terms 



Keywords: browsers, geographic information systems, geospatial information retrieval, 
navigation 
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Luis Gravano, Hector Garcia-Molina, Anthony Tomasic 

May 1994 ACM SIGMOD Record , Proceedings of the 1994 ACM SIGMOD international 

conference on Management of data, volume 23 issue 2 
Full text available* «P| pdf(1 36 MB) Additional Information: full citation , abstract , references , citings , index 
{a) terms 

The popularity of on-line document databases has led to a new problem: finding which text 
databases (out of many candidate choices) are the most relevant to a user. Identifying the 
relevant databases for a given query is the text database discovery problem. The first part 
of this paper presents a practical solution based on estimating the result size of a query and 
a database. The method is termed GIOSS— Glossary of Servers Server. The second part of 
t ... 

17 Visualization: Periscope: a system for adaptive 3D visualization of search results 
Wojciech Wiza, Krzysztof Walczak, Wojciech Cellary 

April 2004 Proceedings of the ninth international conference on 3D Web technology 

Full text available: ^ pdf(1 .37 MB) Additional Information: full citation , abstract , references , index terms 

A system for efficient 3D visualization of Web search results is presented. The system, called 
Periscopel, uses a novel approach for adaptive and customizable visualization of complex 
data. The whole process is divided into a number of interactive steps. At each step, the 
system can automatically choose the best method of presenting search results. The user can 
also select a specific presentation method to focus on certain properties of the result 
obtained. After analyzing the current search res ... 

Keywords: adaptive interfaces, human-computer interfaces, virtual reality 



18 P1: "Yes, but does it scale?": practical considerations for database-driven information 
systems 

John Russell 

October 2001 Proceedings of the 19th annual international conference on Computer 
documentation 

Full text available 1 ^ pdf(231 31 KB) Additional Information: full citation , abstract , references , citings , index 
' l3 : terms 

This paper explores the process of designing and implementing a database-driven system of 
online documentation, and putting it live on the web for customers to use. Using real-life 
examples, it discusses practical considerations for balancing performance, scalability, and 
reliability. 

Keywords: Oracle, automation, categorization, database, performance, reliability, 
scalability, web services 

19 ScentTrails: Integrating browsing and searching on the Web 
Christopher Olston, Ed H. Chi 

September 2003 ACM Transactions on Computer-Human Interaction (TOCHI), volume 10 

Issue 3 

Full text available: pdf(654.98 KB) Additional Information: full citation, abstract, references, index terms, 
l^l review 

The two predominant paradigms for finding information on the Web are browsing and 
keyword searching. While they exhibit complementary advantages, neither paradigm alone 
is adequate for complex information goals that lend themselves partially to browsing and 
partially to searching. To integrate browsing and searching smoothly into a single interface, 
we introduce a novel approach called ScentTrails. Based on the concept of information scent 
developed in the context of information foraging theory, ... 
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Keywords: ScentTrails, World Wide Web, browsing, information scent, searching 



20 Web search 2: Using micro information units for internet search 
Xiaoli Li, Tong-Heng Phang, Minqing Hu, Bing Liu 

November 2002 Proceedings of the eleventh international conference on Information 
and knowledge management 

Full text available: «jg pdf(572.32 KB) Additional Information: full citation , abstract , references , index terms 

Internet search is one of the most important applications of the Web. A search engine takes 
the user's keywords to retrieve and to rank those pages that contain the keywords. One 
shortcoming of existing search techniques is that they do not give due consideration to the 
micro-structures of a Web page. A Web page is often populated with a number of small 
information units, which we call micro information units (MIU). Each unit focuses on a 
specific topic and occupies a specific area of the ... 

Keywords: micro information units, web page segmentation, web search 
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